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Abstract —This paper presents a sensor-control method for 
choosing the best next state of the sensor(s), that provide(s) 
accurate estimation results in a multi-target tracking application. 
The proposed solution is formulated for a multi-Bernoulli filter 
and works via minimization of a new estimation error-based 
cost function. Simulation results demonstrate that the proposed 
method can outperform the state-of-the-art methods in terms 
of computation time and robustness to clutter while delivering 
similar accuracy. 

Index Terms —multi-Bernoulli filter, random finite sets, PHD 
filter, sensor-control. 

I. Introduction 

ENSOR-control techniques in multi-target tracking prob¬ 
lems are designed to find the optimal control command 
from a set of admissible commands that results in the most 
rewarding observations. Sensor-control techniques usually in¬ 
volve addressing two main components: 

- multi-object filtering, and 

- sequential decision making. 

To solve the above problems, we formulate the sensor-control 
problem in the partially observed Markov decision process 
(POMDP) framework by proposing to use a multi-Bernoulli 
filter for the multi-object filtering task. This filter enables us 
to devise a new cost function which has a more direct rela¬ 
tion with the OSPA [36] multi-target miss-distance compared 
to information-theoretic-based objective functions commonly 
used in sensor-control solutions. Consequently, it results in 
better performance (in terms of the OSPA error). 

The purpose of multi-object filtering is to jointly estimate 
the randomly varying number and states of multiple objects. 
Several solutions for this problem have recently been devel¬ 
oped in the finite set statistics (FISST) framework, in which 
the multi-object entity is treated as a random finite set (RFS) 
with its distribution being predicted and updated in every 
time step [29]. Examples of such solutions include the PHD 
filter [25], [29], CPHD filter [28], [42], MeMBer filter [29], 
cardinality balanced MeMBer (CB-MeMBer) filter [43] and 
Generalized labeled multi-Bernoulli filter [41]. 

A number of sensor-control solutions have also been devel¬ 
oped to work within the FISST multi-target filtering schemes 
[9]—[12], [24], [26], [30], [34], [35], [45], [47], In these solu¬ 
tions, a criterion is defined to evaluate the expected quality of 
the updated multi-target density after an admissible command 
is applied to the sensor. In this approach, the chosen control 

All authors are with School of Aerospace, Mechanical and Manufacturing 
Engineering, RMIT University, Victoria 3083, Australia. 

This work was supported by the ARC Discovery Project grant 
DP130104404. 


command is the one that provides the best updated density in 
terms of the defined criterion. Examples of this type of criteria 
include the Csiszar information-theoretic functional [27], pos¬ 
terior expected number of targets (PENT) [30], [47] and the 
Cauchy-Schwarz divergence between the probability densities 
of two Poisson point processes [13]. 

The most common criterion to evaluate the quality of 
the updated distribution is its divergence from the predicted 
distribution [34], In two consecutive papers, Ristic et al. [34], 
[35] used Renyi divergence as a reward function to quantify 
the information gained via updating the predicted density in 
a random set filtering framework. In [34], the implementa¬ 
tion of Renyi reward maximization was investigated for the 
“general form” of multi-target filters. Since this approach 
is computationally intractable even for a small number of 
targets, in the second paper [35], the Renyi divergence function 
was approximated based on i.i.d. cluster assumption for the 
multi-target distribution within a PHD filtering scheme. Later, 
in [12] and [14] the Renyi divergence objective function was 
implemented via the multi-Bernoulli filter. 

In our recent study [9], reported independently from [12] 
and [14], we introduced the use of multi-Bernoulli filtering in 
a task-driven objective function. This criterion is introduced 
in the form of a cost that would be minimized over all 
admissible control commands. It is important to note that 
in multi-Bernoulli filter, estimation of the number and states 
of targets from the updated multi-Bernoulli distribution is 
straightforward and no clustering is needed. Furthermore, 
multi-Bernoulli filters have shown to generally provide ef¬ 
fective solutions for multi-target tracking methods in various 
multi-target applications using radar, video and audio-visual 
measurements [15]—[18], [40], [43], We also note that both 
in our earlier sensor-control method, and in the method intro¬ 
duced in this paper, the cardinality-balanced multi-Bernoulli 
filter (CB-MeMBer) [43] is used as an appropriate multi-object 
filtering scheme for the sensor-control method. CB-MeMBer is 
proven to result in unbiased cardinality estimates under high 
SNR [43] . 

In our previous work [9], the cost function was defined 
in terms of the estimation error of only the cardinality of 
the multi-target state with the assumption that in practice, an 
accurate estimate of the number of target would be followed 
by accurate estimates of the actual target states. Around the 
same time, a similar approach based on using a variance-based 
cost function was also pursued by Hoang et al. [14]. Unlike the 
cost function presented in [9], in which the statistical variance 
of cardinality around its mean is chosen as the cost function, 
Hoang et al. [14] used the variance of cardinality around its 
MAP estimate as the cost function. 
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In this paper, we present a new cost function called Posterior 
Expected Error of Cardinality and States (PEECS) that its 
minimization can lead to a high quality updated density 
even with limited observability. The proposed cost function is 
intended to account for both localization and cardinality errors. 
Inclusion of the extra terms for localization errors in PEECS 
(compared to cardinality variance-only costs used in [9], [12]) 
can enhance the performance of sensor-control, especially in 
challenging cases with numerous targets and moderate to high 
rates of clutter. 

The rest of paper is organized as follows. In Sec. II, a 
formal statement of the sensor-control problem is presented. 
Since, our sensor-control solution is specifically formulated to 
work within multi-Bernoulli filtering schemes, the cardinality- 
balanced multi-Bernoulli filter is briefly reviewed in Sec. III. 
Then, the details of sensor-control method are presented in 
Sec. IV followed by comparative numerical studies in Sec. VI. 
Sec. VII concludes the paper. 

II. Problem Statement: sensor-control 

This paper focuses on developing an effective sensor-control 
solution for applications where mobile sensors are used for 
multi-object tracking while only instantaneous performance is 
considered (myopic approach). 

The main aim of sensor-control is to choose the best control 
command(s) from a number of admissible commands. The 
application of the chosen command changes the sensor(s) state 
e.g. the location, heading or both to reach more accurate 
estimation of the number and states of targets via a set of 
noisy measurements which may include by clutter. The quality 
of a sensor-control command is usually evaluated in terms of 
a reward function (to be maximized) or a cost function (to 
be minimized). Following [3], [34], [35], we formulated the 
sensor-control problem in the Partially Observable Markov 
Decision Process (POMDP) framework. POMDP is a general¬ 
ized form of Markov Decision Process (MDP) [19] in which 
there is no direct access to the slates and the states information 
are only realized by noisy observations. The POMDP frame¬ 
work employed in this paper to formulate the sensor-control 
problem includes the following elements at any time k : 

- a finite set X k comprising single-object states; 

- a set of sensor-control commands (actions) U; 

- a stochastic model for single-target state transition 

fk\k-l(% k\% k-l) •> 

- a finite set of observations Z, generally made by N 
sensors 1 ; 

- a stochastic measurement model gk(z\x); and 

- a function V( u: Xk) that returns a reward or cost associ¬ 
ated with action command «tO. 

In principle in a POMDP framework, the action space is infinite 
and continuous. To reduce computational complexity, we as¬ 
sume that the sensor(s) can choose from a finite set of actions 
or commands, namely admissible control commands [1]. 

1 Without loss of generality, in our formulations and simulation studies, we 
consider multiple observations from one sensor and note that extension of the 
proposed solution to the general multi-sensor case is straightforward. 


The purpose of sensor-control is to find the control com¬ 
mand u which optimizes the cost (or reward) function. In 
stochastic filtering, where the multi-target states Xk-\ and Xk 
are characterized by their distributions, the control command 
u is commonly chosen to optimize the statistical mean of 
the objective function V(it; Xk) over all observations. The 
function V(u; Xk ) is a real-valued objective function of the 
control command and estimated state of the targets. Depending 
on the nature of this function (being either a ” cost ” or 
a ” reward ”), the sensor-control solution would minimize or 
maximize it, 

u k = argmin/argmax{E Zfc(u ) [V(u; X fc )]} . (1) 

•ueU iteU 

The objective function usually depends on the statistical 
distribution of Xk-i and X k , which can be recursively com¬ 
puted in a Bayesian filtering scheme. The latest development 
in multi-target filtering is Mahler’s finite set statistics (FISST) 
framework which provides a set of mathematical tools that 
allows direct application of Bayesian inferencing to multi¬ 
target problems. In a seminal paper [39], Vo et al. established 
the relationship between FISST and conventional probability. 
The theory provides a rigorous paradigm for calculations 
involving RFSs based upon the notion of integration and 
density in point process and stochastic geometry [6], [31], 
[37], This approach to multi-target filtering has influenced a 
variety of research areas such as autonomous vehicles and 
robotics [32], radar tracking [21] and image processing [23], 
Since the general form of the multi-target Bayes filter 
is intractable [29], the FISST framework also includes a 
number of computationally feasible approximations. Exam¬ 
ples of such filters include the probability hypothesis density 
(PHD) [25], the cardinalized PHD (CPHD) [28], the multi- 
Bernoulli (MeMBer) [29] and the cardinality balanced MeMBer 
(CB-MeMBer) [43] filters. PHD and CPHD filters are imple¬ 
mented via Sequential Monte Carlo (SMC) method [39], [44], 
[46] or using Gaussian mixtures [38], [42]. The convergence of 
PHD and CPHD filters has been studied in [2], [4], [5], [39]. 
SMC and Gaussian mixture implementations of CB-MeMBer 
filter have also appeared in [43], 

As mentioned earlier, due to the better accuracy and 
lower computational requirements of multi-Bernoulli filtering 
schemes, in applications where SMC is necessary, this paper 
focuses on an effective sensor-control solution for CB-MeMBer 
filter. The common approach to sensor-control within multi¬ 
target filtering schemes is to choose the reward/cost function in 
terms of the predicted multi-target state and the expected up¬ 
date outcomes for every admissible control command. Before 
we present our choice of cost function, the multi-Bernoulli 
filtering scheme is briefly reviewed in the next section. 

III. CB-MeMBer Filter 

A multi-Bernoulli RFS is defined as the union of M in¬ 
dependent Bernoulli RFSs X^\i = 1,..., M. Each Bernoulli 
RFS is characterized by the parameter r^ and function p^\-), 
where r ^ is the probability of existence of an element in 
(being a singleton) and p^\-) is the probability density 
function (pdf) of the state of the single element of the set 
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if it exists. It has been shown that all statistical properties 
of a multi-Bernoulli RFS can be completely characterized by 
the set of M parameter pairs, existence probabilities r^ and 
the distributions pW(-), and a general multi-Bernoulli RFS is 
commonly denoted by X ~ [29]. 

The multi-Bernoulli Bayesian filter was first derived by 
Mahler [29]. He showed that if the prior distribution of the 
multi-target random set state is multi-Bernoulli, the predicted 
and updated posteriors are also approximately multi-Bernoulli. 
Later, the original filter was shown to produce biased cardinal¬ 
ity estimates [43] and a modified version, called Cardinality- 
Balanced MeMBer (CB-MeMBer) filter, was formulated in a 
numerically tractable form and implemented by the Sequential 
Monte Carlo (SMC) [43]. 

Suppose that at time fc - 1, the posterior multi-Bernoulli 

density of the multi-target state is given by {(rEi,pE,) 

(i) 

and each p)'- { is approximated by a set of particles: 


Pk-i( x )= E w tk -\) 6 ^•«(*)• 

j=i fc - 1 


( 2 ) 


For existing and new born targets, two proposal densities are 
given and denoted by q^\-\xk-i, Zk) and b^\-\Zk). The 
predicted multi-target density (also a multi-Bernoulli density) 
is represented by the union of the survived targets and the 
newly born targets: 




[( (i) (i) 
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where existence probabilities and distributions of the predicted 
Bernoulli components are given by: 
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Let us denote the predicted multi-Bernoulli distribution by 
{(^S-i^SLi)Cr 1 - The updated multi-Bernoulli is rep¬ 


resented by the union of legacy tracks and measurement- 
corrected tracks [29], [43]: 


^ = {( r L,fc>PL,fc)} i= i* 1 u *))} zeZfc (6) 

with the following existence probabilities and singleton den¬ 
sities: 
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To avoid degeneracy, the CB-MeMBer filter also includes a 
resampling step for each track. To prevent the numerical 
explosion of the generated particles, the existence probabilities 
are thresholded and the low probability tracks are removed 
while similar ones are merged - details of these steps are 
provided in [43]. 


IV. MULTI-Bernoulli SENSOR-CONTROL 
In a multi-Bernoulli target tracking scheme, assume that 
at time k — 1, the multi-target distribution parameters are 
where M]~-i indicates maximum num¬ 
ber of targets. This distribution is propagated through 
the multi-Bernoulli prediction and update steps, and turns 
into an updated multi-Bernoulli distribution with parameters 
{rlEpE }Ei ■ As part of the multi-target tracking solution, 
at each time fc, the number and states of targets are extracted 
from the updated multi-Bernoulli parameters. We note that the 
sensor measurements are used within the update step of the 
recursion, and affect the quality of the updated distribution. 

In practice, the quality of sensor measurements usually 
depends on a sensor state (e.g. the sensor location) which is 
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assumed to be controllable, and the sensor-control problem 
is focused on choosing the command that would lead to 
the best sensor state. As it was mentioned earlier, most 
solutions are based on maximizing an information theoretic 
reward function such as Renyi divergence [20], [34], [35]. 
The main rationale behind choosing such reward functions 
is that the information encapsulated by the estimated multi¬ 
target distribution is expected to gradually increase as further 
measurements become available in time. 

In this paper, we take a different approach in which the 
updated multi-Bernoulli distribution parameters are directly 
utilized to define a new cost function. Our approach is to 
define a cost function that quantifies the average uncertainty 
in all possible multi-target state estimates after each update 
step. 2 The main difference here is that our focus is on 
measuring the quality of the updated density in terms of level 
of uncertainties, not the information gained from prediction to 
update (e.g. Renyi divergence function). 

The updated distribution depends on the measurement set 
which is a function of the chosen sensor-control command. In 
principle, the whole distribution of all possible measurement 
sets is used to compute the update distribution. However, using 
whole measurement distribution is computationally expensive 
if not intractable. Thus, to reduce the computational complex¬ 
ity, for each sensor control command, only the predicted ideal 
measurement set (PIMS) is used [27]. In PIMS approach only 
one future measurement sample set is generated for each con¬ 
trol command under the ideal conditions of no measurement 
noise and no clutter where probability of detection is equal 
to one. 3 In order to define the new cost function, we note 
that the PIMS depends on the chosen control command. For 
each command, we first compute the PIMS, then calculate an 
updated multi-object distribution ( r fc ) ’7’fe ) }f=l by considering 
the PIMS as the acquired measurement. 

A linear combination of the normalized errors of the number 
of targets and their estimated states are considered as a 
measure of uncertainty associated with estimation of the multi¬ 
target state and as the cost function: 

V(u;X k ) = r] £\ X \( U ) + i 1 * ~ v) < 4 :( w )> ( 9 ) 

where £pq(w) denotes the normalized error of estimated 
cardinality of the multi-target state and e A -(u) denotes the 
normalized error of the multi-target state estimate. The details 
of defining and computing the normalized error terms, 
and s 2 x (u) for SMC implementation are presented in Sec. V 
through equations (10)—(16). Note that rj 6 [0,1] is a user- 
defined constant parameter to tune the influence of the error 
terms on the total sensor control cost. It is also important to 
note that the expectation term in (1) does not appear as we 
use the PIMS approach [27] instead of sampling and averaging 
in measurement space. 

2 It is important to note that this cost is not totally independent of the 
prediction outcomes, since state estimates extracted from predicted multi- 
Bernoulli density are used to calculate the proposed cost function. 

3 In situations where the probability of detection is high and clutter rate is 

moderate, the multi target posterior density can be approximated under the 

assumptions of the PIMS (see [27]-section 4.1 and [33]). 


V. Implementation 


At time k - 1, the multi-target distribution is modeled by a 
multi-Bernoulli RFS with parameters {rl 1 ^ , where 

M k -1 indicates the maximum possible number of targets. 


and each p[ i3 
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the multi-Bernoulli filter propagates the multi-Bernoulli com¬ 
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As it was mentioned before, the updated distribution would 
depend on the measurement set which in turn would be 
dependent on the chosen sensor control command. Similar to 
Ristic et al. [33], for each control command, a single pseudo¬ 
measurement is generated for each single object under ideal 
conditions (perfect detection, zero measurement noise and no 
clutter). The ensemble of such pseudo-measurements forms the 
PIMS. To compute each of the ideal measurements, we need to 
know the single-target states which can be estimated from the 
predicted multi-target distribution via the following two steps. 4 
First, the estimated number of targets, denoted by Mk\k-i 
is obtained by counting the predicted existence probabilities 
that exceed a given threshold (e.g. 0.5 as an unbiased value). 
Then, the EAP estimate of the /:-th target state is given by 


4.W _ y 

•^’-'k -1 1 


k \ k 1 where the indices i? refer to 

3 = -L k\k-l k\k-l 42 


k\k-l ^j =1 k\k-l 

the Bernoulli components that were counted in step I. 

Having the ideal measurement set for each admissible 
sensor command, u k e U/,, and considering it as the actual 
measurement, we can then run the update step and calculate 
the cost corresponding to that command. The cost defined 
in (9) combines two normalized error terms, e? x ^u) for 
the cardinality estimate and e A -(u) for the multi-target state 
estimate. Both terms depend on the updated multi-object 
posterior ,P^\ lk (-)}i = i’ Uk which in turn depends on the 

PIMS computed for the command u k . 

In [7] and [8], Delande et al. showed that the regional 
variance of the number of targets quantifies the certainty of 
the filter estimates of the number of the targets that evolve 
in surveillance region. Following [8], we chose the variance 
of the cardinality as a meaningful measure for its uncertainty 
or estimation error. In terms of the updated probabilities of 
existence this variance is given by: 


2 / \ x Mk - 

°m( Wfc ) = £ i= i 


Kkp-’•“»)]• < 10 > 


It is important to note that in the linear combination 
presented in equation (9), in order for the weight rj to be 
dimensionless and bounded within the [0,1] interval (so that 


4 It is important to note that same procedure is used to finally extract the 
multi-target estimates after the control command is chosen and applied, the 
measurements are acquired from sensors and the multi-target state is updated. 
Furthermore, we note that alternative methods (such as the method suggested 
in [43]) could be also used for this purpose. 
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both r/ and 1 - 77 weights are positive), the two error terms 
have to be dimensionless. We choose “normalized” error 
terms which are both dimensionless and bounded within [ 0 , 1 ] 
themselves. Thus, the linear combination forming the cost 
in (9) will be guaranteed to be bounded within [0,1]. The 
normalized cardinality error term can be computed as follows: 

ji 


\x 


( Uk ) = 


' 1*1 


( Uk) 


max{<7j^|(w.fc)} 


( 11 ) 


The maximum variance occurs when Vi,r ^ = 0.5. 

’ K.Mfs 


max{< 7 i 2 Y i(wfc)} = 
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To arrive at a meaningful measure for the normalized state 
estimation error term e 2 x (u) in the cost defined in (9), we first 
compute the following total state estimation error: 


2 / \ _ 
£ X\ U k) ~ £i=l 


f (*) 2 / \] / 

[ r k'u k ^y u k)\ / E i= i 


(0 
r) ’ 

k,Uk 


( 12 ) 


which is the weighted average of estimation errors of the 
states of single targets associated with each single Bernoulli 
component. The rationale behind this choice of weights is that 
Bernoulli components with larger probabilities of existence 
contribute more strongly to the EAP estimate of the multi¬ 
object state -see section IV-A.4 in [43], Although e\ is not 
directly related to estimation error, we named it as the “total 
state estimation error”. The philosophy behind this choice lies 
in the fact that reduction of <; 2 (i) increases the confidence of 
estimation which implicitly increases the expected accuracy of 
estimation. 

To compute the single Bernoulli component errors f 2 (t) (uf.fi 
we note that in practice, minimizing the estimation error of 
only some selected elements of target states maybe of the 
interest. For instance, in target tracking, usually the prime 
interest is in location, and target speed is included in the 
single-target state vector due to its appearance in motion and 
perhaps measurement models. In such target-tracking appli¬ 
cations, an intuitive scalar measure for the single Bernoulli 
component error is given by the product of the variances of 
the target location coordinates. If the stochastic variations of 
target location coordinates are independent, this measure will 
translate into the absolute determinant of the covariance matrix 
of the target location. In case of tracking multiple-targets in 
two-dimensional space, the single Bernoulli component error 
term, <; 2 (i) (u k ), is given by: 


^(i) ( Uk ) cr 2 (l) (u fc ) cr 2 (l) (u k ) (13) 


where x and y denote the x and y-coordinates of the single¬ 
target location (part of its state vector x ), and the proportion¬ 
ality (not equality) is chosen because the above product does 
not lead to a normalized measure. We note that in theory, 
a very small cr 2 (i) (uic) and an unacceptably large a^fiuk) 
can still lead to a relatively small ^^(ufi). However, a near 
zero determinant of covariance matrix for target locations 
would contradict our independence assumption for the target 
movements. A remedy would be to take the sum of normalized 
variances along the coordinates. But unlike the product, this 
sum would have no meaningful interpretation. Having the 


updated particles and weights of each Bernoulli component, 
the single-coordinate errors can be calculated as follows: 
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where x£*’^ and denote the coordinates extracted from 

k,Uk J k,Uk 

the particle x^fi^ and power operation is element-wise. To 
normalize the total state estimation error term <r 2 ( i)(itfc) 
in (13), we note that with equally weighted particles, i.e. when 
= 1/L^ u , the particles representing the «-th single 
Bernoulli component do not convey any information and the 
above estimation variances adopt their maximum values as 
follows: 
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The single Bernoulli error terms fi 1 ^ (uf ; ) in (13) can be 
normalized as follows: 


SrG) ( M fc) 


CT 2 (l) (ltfc) CT 2 (l) (Mfc) 
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(16) 


and the computed values are then used in ( 12 ) to calculate the 
normalized state estimation error term in the cost. We note that 
the product of variance terms in (13) is a squared variance, 
but after normalization, it would be still a normalized measure 
for confidence of estimation. Extension of the terms derived 
in (13)—(16) to the cases involving more than two dimensional 
location parameters is straightforward. 

Having the cost values computed for all admissible sensor 
control commands, the best command u k is then chosen as 
the one incurring the smallest cost: 


u k = argminV(u fe ; X k -f). (17) 


Algorithm 1 shows the pseudocode for the CB-MeMBer 
multi-target filtering scheme with the proposed sensor-control. 


VI. Simulation Results 

The performance of the proposed PEECS sensor-control 
method was evaluated in two challenging case studies. Case 
study 1 is borrowed from [35] and [14] and used to compare 
the performance of PEECS with the PHD-based sensor control 
method for different clutter rates. Case study 2 is a more 
challenging extension of case study 1 in which the targets 
are not pseudo stationary (they move and turn through the 
scenario), the motion model is more complex, and in addition 
to range, bearing measurements are also available. 
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Algorithm 1 The CB-MeMBer multi-target filtering recursion 
with PEECS sensor-control._ 

INPUTS: time k, dynamic model fk\k-i(‘\ x k-i), multi-Bernoulli birth model 
parameters, prior multi-Bernoulli parameters from time k — 1, detection probability 
Pd('), measurement likelihood function gk(-\x), and clutter intensity i/(-) and 
its integral A c , current sensor(s) location(s), finite set of admissible sensor-control 
commands HJ. 

Output: The best control command u and updated multi-Bernoulli parameters. 


Prediction: 

1: Using equations (1)—(3), compute the predicted multi-Bernoulli component param- 
eters and their particles {rj^ . { w k\k-l ’ x k\k-l} • 


Pre-estimation: 

2 : £<- 0 , 

3: for 2 = 0, M k \ k _ 1 do 
4: if r^ k _ 1 >0.5 then 

5: l<-£+l 

C, ~(£) ^k\k-l ( i,j ) (i,j) 

V. X u\i i i W u\u\ X u\u 1 

re|fc—1 ' J = 1 1 1 

7: end if 

8: end for 

9: M k \k-i*-t 

Sensor-control: 

for all life e U do, 

Z <- 0 . 

for t = 1, M k \k-i do 

C «- argmax z g(z\x[y k l ) 

Z^Zu{i} 

end for 


10 

11 

12 

13 

14 
15: 

16: 

17: 

18: 

19: 

20 : 

21 : 

22 : 

23: 

24 

25 

26 


[> Constructing the PIMS: 


[> Update for sensor control: 
Use the PIMS Z in (7)-(8) to update the multi-Bernoulli distribution parameters. 

[> Computing the cost: 

e^(u k ) «- fc (l-r« fc )]/^i.. 

r.W 




v<0 


(n k ) «- 


v k, Uk ^k,v. k 


K,U k K,U k 


Ji) 

w k,n k x k ,u k 


E,-. 


“ktykt 

K > u k K ’ u k 


max {^(.)(»l)} ♦-(TJ— (4-(T5—) £ 3=1 


(0 

x (i.d) 

X k,u. k 

^ .. 2 
maxflji) (“fc)> - 77II— U - 77^) y k ,'« k 


xW 


( u k) ■ 


x(0 


( u fc) 1 


y (0 


(«fc) 


max{ff 2 ^.j (w fe )} max{ff^.^(u fe )} 


V(u; A7) *- r) Ep Y |(«) + (I- 17 ) 4(“). 

end for 

n <- argmin u V(u; Xfc) 


fc .“fc -CO 

k,U k 


Measurement: 

27: Apply the control command n to change the sensor state. 
28: Read the actual measurement set Z k . 


Update: 

29: Use the set Z k as measurement set in equations (7)-(8) and compute the updated 
multi-Bernoulli parameters. 

30: Prune and merge the updated Bernoulli components. t> More details in [43]. 


A. Case study 1 

A controllable range-only sensor is used to track multiple 
targets in a surveillance application. Each single target state 
is comprised of location and velocity components in x and y 
directions, denoted by [x y x y] T . In the particular scenario 
borrowed from [14], [35], there are five targets. The sensor 
location is denoted by s = [x s y s ] T . The sensor can detect an 
object in location o = [x Q y 0 ] T with the following location- 



Fig. 1. Sensor locations for PEECS cost function and MAP cardinality 
variance cost function [14] during k = 1,..., 20 for range-only measurement. 


dependent probability: 

/ \ f 1 , if |]o-s|] < Ro 

Pd{s,o) < max{ 0,1 - h(||o - s]| - f?o)} . otherwise 

(18) 

To make the results comparable, we used the same parameters 
used in [14], [35]. Those are as follows: 

- surveillance area: 1000 m x 1000 m, 

- Rq = 320 m, 

- h = 0.00025 m- 1 , 

- measurement model: 2 = ||o - s|| + e where e ~ N ( 0 , a 2 ), 
a = cr 0 + /3]|o - s]] 2 , do — 1 m and fj = 5 x 10 -5 nr -1 . 

The Poisson RFS for clutter has the intensity v(z) = A c c(z) 
where c(z ) = t/[0,1000^2], and A c = 0.5. 

Overall, there are five objects in the surveillance area, 
positioned relatively close to each other. Their initial state 
vectors are: [800 600 1 0] T , [650 500 0.3 0.6] T , [620 700 
0.25 0.45] T , [750 800 0 0.6] T , and [700 700 0.2 0.6] T , where 
the units of x and y are meters and x and y are m/s. The 
objects move according to the constant velocity model [9], 
[14], [22], [35], 

The controllable mobile sensor initially enters the surveil¬ 
lance area at position s = [10m 10m] T . Other parameters 

such as the dynamic parameters of the motion model, proba¬ 
bility of survival, target birth model, finite set of admissible 
control commands, and relative number of particles are also 
borrowed from [14], [35], As it is also noted in [35], the 
range dependent sensor noise and moderate rates of clutter and 
miss-detections make this a very challenging case of sensor- 
control for multi-target estimation in which many state-of-the- 
art techniques would fail. 

Figure 1 shows the initial and final locations of the five 
targets in this scenario, and demonstrates how the sensor 
location is controlled towards locations with better accuracy 
and detection rate by PEECS method (Algorithm 1). We have 
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Time 

(a) OSPA cardinality error 



(b) OSPA metric error 


Fig. 2. Estimation errors of the PEECS sensor-control method compared to 
the PHD-based sensor-control. 


compared PEECS sensor-control with recent PHD-based sensor 
control methods in terms of their OSPA errors. In addition, we 
compared our method with the multi-Bernoulli-based sensor- 
control method of Hoang et al. [14] which is more similar. 
Figure 1 shows the controlled sensor locations of this method 
compared to PEECS sensor-control method. We will further 
elaborate on this comparison later. 

We computed the localization, cardinality and OSPA er¬ 
rors [36] at time steps k = 1,..., 35. The estimation errors 
for three sensor-control methods using different reward/cost 
functions (PENT [24], Renyi divergence [35] and PEECS) are 
shown in Fig. 2 for the case of clutter rate A c = 0.5. Note that 
OSPA errors for Renyi divergence are computed and plotted 
for a = 0.5 and a = 1. We observe that during the initial time 
steps, the OSPA error returned by the PEECS sensor-control 
is significantly smaller than (about half of) the error of the 
two PHD-based sensor-control methods reported in [35] which 
have been demonstrated to outperform PENT and other existing 
methods. However, as it is reported in [35], the reason of poor 
performance of PENT lies in the fact that its objective function 
is designed for sensor-control problems in which the sensor 
has finite field of view, which is not the case in this scenario. 

Although PEECS and PHD-based methods converge to simi¬ 
lar error values, the initially smaller error of the PEECS sensor- 



■ -O - Renyi-PHD A=5 ■ -O - Renyi-PHD A=3-Renyi-PHD A=0.5 

—e— PEECS A=5 — 6— PEECS A=3 -PEECS A=0.5 


Fig. 3. Estimation errors with different clutter rate of our sensor-control 
method compared to the PHD-based sensor-control . 


control demonstrates that the sensor is guided towards its 
optimum location faster and arrives there much earlier than 
the PHD-based methods that work based on maximizing Renyi 
divergence. A video of the live simulation run is attached as 
supplemental material. It is observed from the run that after 
the first 13 time steps, our sensor-control drives the sensor 
to a close vicinity of the center point of the five targets, and 
during the rest of the simulation, the sensor stays in that region 
with minor movements. This observation is consistent with 
the OSPA errors of PEECS shown in figures 2(a) and 2(b) 
converging to its minimum at about time step 13. 

To investigate the robustness of PEECS sensor-control to 
increased levels of clutter, we compared its OSPA errors 
with the PHD-based sensor-control method reported in [35] 
for different clutter rates of A c = 0.5,3,5. The errors were 
averaged over 200 Monte Carlo runs of each method and 
are shown in Fig. 3. It is observed that by increasing the 
clutter rate A c from 0.5 to 3 then to 5, our proposed method 
returns OSPA errors that are generally smaller than the errors 
returned by the PHD-based sensor-control. We also note that 
the presented OSPA errors in the paper only provide a broad 
indication of sensor-control performance, and such results are 
scenario dependent. 

The most significant advantage of PEECS sensor-control 
method is its low computational cost. Table I shows the 
computation times of PEECS sensor-control compared with 
the PHD-based sensor-control method of [35], averaged over 
200 Monte Carlo runs, for different clutter rates. 5 We observe 
that in average, PEECS sensor-control runs at least 4 times 
faster than competing methods. Note that the result of the 
computation times should only be considered as a broad 
indication of sensor-control performance, since these results 
are scenario and code dependent. We also note that PEECS 
does not consider possible constraints on the sensor field of 
view and needs to be revised in presence of such constraints. 
Comparison of the computational cost of the revised version 
of PEECS with methods like PENT [27] (that take such limits 
into consideration) is beyond the scope of this paper and left 

5 Both methods were coded in MATLAB and run on the same machine. 
For the PHD-based sensor-control, we used the code provided by the authors 
of [35], 
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TABLE I 

Comparison of computation times (seconds) 


Clutter rate (A c ) 

0.5 

3.0 

5.0 

PEECS 

0.310 

0.333 

0.373 

PHD-based method [35] 

1.336 

1.631 

1.973 


for future research. 

The lower computational cost of PEECS sensor-control is 
mainly due to the fact that it is designed to work with the 
multi-Bernoulli filter. Extraction of cardinality and single¬ 
object state estimates in a multi-Bernoulli scheme is straight¬ 
forward compared to PHD and CPHD methods. In the latter, 
the particles approximating the intensity function need to 
be grouped via a clustering algorithm. However, no such 
clustering step is required by multi-Bernoulli methods, hence 
substantial savings in computation is made particularly in 
applications that involve numerous objects. In such applica¬ 
tions, the total number of particles, representing Monte Carlo 
approximations of the intensity function, is required to be very 
large otherwise substantial amount of information would be 
lost during the clustering step. The need for a large number 
of particles would impose heavy computation burden in those 
filtering schemes. 

In Fig. 4, OSPA error of PEECS sensor-control method is 
compared with the method reported in [12], [14]. Hoang et 
al. [14] used multi-Bernoulli filter in conjunction with Renyi 
divergence as a reward function. We observe that PEECS 
sensor-control performs better than the Renyi divergence- 
based sensor control method proposed in [12], [14] in terms 
of OSPA error. This is mainly because PEECS cost function is 
structurally related to the error terms computed in OSPA. We 
note that the superior performance of PEECS in terms of OSPA 
error may not be necessarily valid in terms of information gain 
which is the major focus of Renyi divergence-based sensor 
control methods. 

Hoang and Vo [14] also proposed a multi-Bernoulli sensor- 
control method in which the variance of cardinality around 
its MAP estimate (termed “MAP cardinality variance”) is used 
as the cost function. This cost function is similar to PEECS. 
However, in PEECS, the variance term is the statistical variance 
(around the mean, not around the MAP estimate) of cardinality, 
and more importantly, the cost is penalized due to errors not 
only in cardinality estimates but also in state estimates. As it is 
shown in Fig. 4, our method outperforms the method reported 
in [12], [14] in terms of OSPA errors.. 

The controlled sensor locations resulted by PEECS and the 
method reported in [14] (with MAP cardinality variance as 
the cost function) are shown in Fig. 1. We note that in 
the case studies discussed in this paper, some measurement 
parameters depend on the sensor-target distance in such a 
way that a shorter sensor-target distance generally leads to 
more accurate measurements. To be more precise, based on the 
measurement models in case study 1, with a shorter sensor- 
target distance, higher probability of detection and smaller 
measurement noise are achieved. This seems to be a commonly 
accepted assumption as it is reported in many similar works 



Fig. 4. The comparative results for PEECS method and the methods reported 
in [14]. In this case, clutter rate is burrowed from [14] to be A c = 0.5. 

on sensor control such as [14], [34], [35]. With such a 
distance-dependent measurement model, an effective sensor 
control method is expected to guide the sensor towards the 
center of the targets, and maintain it in the vicinity of the 
center as the targets move. Figure 1 clearly shows that PEECS 
directly moves the sensor towards the center of the targets and 
maintains it in the center as the targets move. This is while 
the sensor trajectory created by the competing method of [14] 
seems to be deviated from the path to the center of the targets 
and not to converge and remain in the center as the targets 
move. Hence, PEECS leads to the significantly improved OSPA 
errors shown in Fig. 4. 

As stated by Hoang and Vo [14], “the lack of observability 
of the full states, as we now have range-only measurements,” 
causes the poor performance of their proposed sensor control, 
especially when only the variance of cardinality around its 
MAP estimate is optimized. If the objective function “ac¬ 
counts for both localization and cardinality criteria”, which 
is indirectly the case when Renyi divergence is chosen as 
the objective function, sensor control is expected to result in 
“lower error” [14]. We note that both cardinality and state 
estimation errors are explicitly considered in PEECS objective 
function. 


B. Case study 2 

To demonstrate the performance of our method with mea¬ 
surements that guarantee full observability of the targets, and 
to investigate the particular effect of the localization error term 
in the proposed cost function, we also designed a simulation 
involving a more complex scenario than case study 1. In 
this scenario, we chose the non-linear nearly-constant turn 
model employed in [43]. In this model, each single target 
state x - [x T w] T is comprised of location and velocity in 
Cartesian coordinates, denoted by x = [x y x y] T and turning 
rate, denoted by oj. The state dynamics are given by: 

x k = F(uj k -i)x k -i + Ge k - i, 

Wfc = Wfc-i + T"/k-i, 
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where 


"1 

0 

sin cjT 

1—cos ujT ~ 


- rj-i2 

0 1 

0 

1 

1—cos luT 

UJ 

sincuT 

UJ 

,G = 

0 

~2 

0 

0 

cos ujT 

- sin col 

T 

0 

_0 

0 

sin ujT 

cos ujT 


. 0 

T . 

e/c-i 


■V(-;0,o- 2 

/), ay = 15 m/s 2 

, and 

Ik- 


Af(-; 0, o' 2 /), a 1 - (7r/180) rad/s. The birth RFS is a multi- 
Bernoulli with density tty - where = 

rp 2 ' = 0.02, Tp 3 ^ = rp 4 ) = 0.03 and p^\x) = N{x\rri^\P 1 ) 
where 


m\ L) = [-1500 0 250 0 0] T , 

rrSf 1 = [-250 0 1000 0 0] T , 

to( 3) = [250 0 750 0 0] T , 

mf = [1000 0 1500 0 0] T , 

P 1 = diag(50 2 ,50 2 ,50 2 ,50 2 ,(6 x I | s ) 2 ). 


Probability of survival, detection probability, initial sensor 
location and clutter rate are similar to the previous case study. 
In this case study, in addition to range, bearing information 
is also available and the observation model consists of noisy 
bearing and range measurements: 


z k = [arctan(^) y/ x fc + Afc] + Ck> 

where £ k ~ 7V(-; 0, R k ) is the measurement noise with co- 
variance R k = diag(aj[, ct 2 ) in which the scales of range 
and bearing noise are erg = (7r/180)rad and ay = 5 m. 
The clutter RFS followed the uniform Poisson model over 
the surveillance region [-7 t/2 , 7t/2 ] rad x [0, 2000] m, with 
intensity of 1.6 x lCT 3 (radm) _1 which is equivalent to a 
clutter rate of A c = 10. 

As it is shown in Fig. 5, the sensor starts moving toward 
the objects and remains between those. We ran PEECS sensor- 
control algorithm for both 77 = 1 and // = 0.5. When the 
parameter // equals 1, the cost function includes only the 
cardinality error term. With r] = 0.5, both cardinality error 
and object state estimation error are equally weighted within 
the cost. We recorded the OSPA errors in both cases at time 
steps k = 1,...,50. The results are plotted in Fig. 6. this 
figure shows that although the targets maneuver in long paths 
(compared to the case study 1 borrowed from [35]), they are 
tracked with reasonably low error, indicating the reliability of 
PEECS. Comparison of OSPA errors in cases where rj = 1 or 
77 = 0.5 also demonstrates the benefit gained in terms of total 
error when both cardinality and state estimation errors were 
considered within the cost function, as it happened in PEECS 
with rj - 0.5. 


VII. Conclusion 

A sensor-control solution was proposed to be employed 
within a multi-Bernoulli multi-target filter. In this method, 
at each step, the next sensor control command is chosen by 
minimizing a new task-driven cost function called “Posterior 
Expected Error of Cardinality and States” (PEECS). The PEECS 
cost associated with each control command is defined as a 
linear combination of the normalized errors in cardinality 



x coordinate (m) 

□ Sensor locations • Target at k = 1 ■ Target at k = 50 

□ Sensor locations • Target at k = 1 ■ Target at k = 50 


Fig. 5. Sensor and target locations in case study 2, during k = 1,..., 50. 



Fig. 6. OSPA errors returned by PEECS in case study 2. 


estimate of the multi-target random finite set and the normal¬ 
ized error of localization of the elements in the multi-target 
RFS. Simulation results involving two challenging multi-target 
estimation and sensor-control scenarios, demonstrated that 
PEECS sensor-control can return multi-object state estimation 
accuracy and clutter tolerance that are similar to or better than 
competing methods, and generally performs faster than the 
state-of-the-art. 
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